Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models
نویسندگان
چکیده
In this paper we propose a new speaker identi cation system, where the likelihood normalization technique, widely used for speaker veri cation, is introduced. In the new system, which is based on Gaussian Mixture Models, every frame of the test utterance is inputed to all the reference models in parallel. In this procedure, for each frame, likelihoods from all the models are available, hence they can be normalized at every frame. A special kind of likelihood normalization, called Weighting Models Rank, is also proposed. Experiments were performed using two databases TIMIT and NTT. Evaluation results clearly show that frame level likelihood normalization technique is superior to the standard accumulated likelihood approach.
منابع مشابه
Text-independent speaker recognition using non-linear frame likelihood transformation
When the reference speakers are represented by Gaussian mixture model (GMM), the conventional approach is to accumulate the frame likelihoods over the whole test utterance and compare the results as in speaker identi®cation or apply a threshold as in speaker veri®cation. In this paper we describe a method, where frame likelihoods are transformed into new scores according to some non-linear func...
متن کاملSpeaker verification using frame and utterance level likelihood normalization
In this paper, we propose a new method, where the likelihood normalization technique is applied at both the frame and utterance levels. In this method based on Gaussian Mixture Models (GMM), every frame of the test utterance is inputed to the claimed and all background speaker models in parallel. In this procedure, for each frame, likelihoods from all the background models are available, hence ...
متن کاملText-Independent Speaker Recognition Using Gaussian Mixture Models Final Term Paper Proposal
The proposed project is an implementation of speaker recognition systems, both identification and verification. The systems are built using Gaussian Mixture Models, as proposed in several papers from Douglas A. Reynolds. The use of Fractional Covariance Matrix is studied as an possible increase for the traditional recognition systems. keywords: speaker recognition; Gaussian Mixture Models; like...
متن کاملInternational Journal of Emerging trends in Engineering and Development ISSN 2249-6149 Available online on http://www.rspublication.com/ijeted/ijeted_index.htm Issue 2, Vol.5 (July 2012)
In This paper presents an overview of a state-of-the-art text-independent speaker verification system. First, an introduction proposes a modular scheme of the training and test phases of a speaker verification system. Then, the most commonly speech parameterization used in speaker verification, namely, cepstral analysis, is detailed. Gaussian mixture modeling, which is the speaker modeling tech...
متن کاملDiscriminative training of GMM using a modified EM algorithm for speaker recognition
In this paper, we present a new discriminative training method for Gaussian Mixture Models (GMM) and its application for the text-independent speaker recognition. The objective of this method is to maximize the frame level normalized likelihoods of the training data. That is why we call it the Maximum Normalized Likelihood Estimation (MNLE). In contrast to other discriminative algorithms, the o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996